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Abstract —In this paper, we consider the performance of the 
Noisy Gradient Descent Bit Flipping (NGDBF) algorithm under 
re-decoding of failed frames. NGDBF is a recent algorithm that 
uses a non-deterministic gradient descent search to decode low- 
density parity check (LDPC) codes. The proposed re-decode pro¬ 
cedure obtains improved performance because the perturbations 
are independent at each re-decoding phase, therefore increasing 
the likelihood of successful decoding. We examine the benefits of 
re-decoding for an LDPC code from the IEEE 802.3an standard, 
and find that only a small fraction of re-decoded frames are 
needed to obtain significant performance benefits. When re¬ 
decoding is used, the NGDBF performance is very close to a 
benchmark offset min-sum decoder for the 802.3an code. 


I. Introduction 

A considerable amount of research has been done in 
the past decade to reduce the complexity of soft-decision 
algorithms for Low Density Parity Check (LDPC) codes. The 
class of Weighted Bit Flipping algorithms (WBF) provide a 
good trade-off between performance and complexity ID- The 
WBF algorithms employ an inversion function at the symbol 
nodes to decide on the bits that needed to be flipped during 
a decoding iteration. Many variants to the WBF algorithms 
have been proposed J2} (3). One such variant is Gradient 
descent bit-flipping (GDBF) algorithm J4j. The GDBF algo¬ 
rithm formulates the decoding problem as a gradient descent 
search problem and provides good performance with very low 
complexity. The major drawback of the GDBF algorithms is 
that they tend to be trapped in spurious local maxima. In the 
context of LDPC decoding, local maxima can be interpreted 
as trapping sets which prevent the decoder from converging 
onto a valid codeword 0. 

The recently proposed Noisy GDBF (NGDBF) algorithm 
provides superior performance by injecting random noise into 
the GDBF inversion function (6). The noise perturbation is hy¬ 
pothesized to disrupt the activity around trapping sets, so that 
the decoder has a chance to escape. In this work, we observe 
that failed NGDBF frames can often be correctly decoded by 
re-decoding the algorithm from its initial state. Re-decoding is 
shown through simulations to be more effective than increasing 
the iterations. To explain this result, we hypothesize that the 
noise perturbations may sometimes stimulate new trapping 
set conditions that are less likely to be escaped. Because the 
NGDBF algorithm is non-deterministic, these conditions will 
not necessarily recur when decoding is performed a second 
time. 


Redecoding has been previously examined for non- 
deterministic decoding algorithms 0-|[9). Tehrani et al. 0 
speculated that re-decoding provides an opportunity for a 
stochastic decoder to evade the dominant (8,8) trapping set in 
the 802.3an LDPC code. In this work, we examine the same 
hypothesis applied to the NGDBF algorithm. 

The remainder of this paper is organized as follows: Section 
II reviews the GDBF and NGDBF algorithms. Section III 
describes the proposed re-decoding method and Section IV 
details the simulation results. Conclusions are drawn in Section 
V. 

II. Background 

A. Notation: 

We denote our parity check matrix by H, which is a binary 
matrix with dimensions to x n\ n > m > 1; {m, n} G Z + . H 
is a sparse matrix, and H tj = 1, corresponds to a connection 
between the i* parity check node and j th symbol node; i G 
[1 to], j G [1 n}. The set of codewords associated to H is 
C = {c G {0,1}" : He = 0}. For AWGN transmission, C 
is mapped to C = c. G {1, —1}". The AWGN transmission is 
defined by y = c + z. Here, y is the received vector at the 
receiver, and z is an n dimensional vector of independent and 
identically distributed Gaussian random noise; with zero mean 
and variance = Nq/2, where Nq is the noise spectral density. 

The neighborhood of a node refers to all the other nodes 
that are connected to it. In this paper, we denote the neighbor¬ 
hood of the i th check node by A f(i) = {j : /i i? = 1}, and the 
neighborhood of the j* symbol node by A4(j) = {i : hij = 
1}; where hij is an element of the parity check matrix H. 

The decision vector at each iteration is denoted by x G 
{ —1,1}". If x is a valid codeword and iGC, all the parity 
checks would be satisfied. The T h parity check operation is 
denoted by Si = rijeA/X*) J ' : >' Therefore, a satisfied parity 
check equation refers to s t = 1. If at iteration t, all the parity 
checks are satisfied, then x(t) is declared as a valid codeword 
and decoding is terminated. In all algorithms, unsuccessful 
frames are terminated after a maximum number of iterations 
T. 

B. The GDBF algorithm and NGDBF algorithms 

Wadayama et al. proposed the GDBF algorithm for de¬ 
coding LDPC codes HI- Several variants of GDBF have 


been described; in this work we consider only the fully- 
parallel multi-bit GDBF algorithm. The NGDBF algorithm, 
described by Sundararajan et al. a modifies GDBF by adding 
a pseudo-random perturbation to the inversion function at each 
symbol node during each iteration. The perturbations consist of 
identical and independently distributed Gaussian random noise 
samples qk with zero mean and variance equal to a 2 = q 2 No/2 
— proportional to the variance of the channel noise — where 
q > 0 is a noise scale parameter. The resulting inversion 
function is 

E k = XkVk + w k 53 Si + q k , 

ieM(k) 

which describes GDBF when q = 0 and w = 1. For each 
symbol node index k, the corresponding decision Xk is flipped 
if Ek < 6, where 6 G M _ is an empirically determined 
threshold value. 

In order to obtain the best performance, NGDBF makes 
use of additional heuristics. The heuristics used in this work 
include output smoothing, in which a sliding-window average 
is taken on the output decisions, and a threshold adaptation 
procedure devised by Ismail et al. tm With adaptive thresh¬ 
olding, each symbol has a local threshold 6k which is adjusted 
dynamically. In J6l, this combination of heuristics is referred 
to as SM-NGDBF. Throughout this paper, any references to 
NGDBF shall refer to the algorithm described as follows: 

Step 0: Initialize 6k (f = 0) = 6 for all k, where 6 is 
the global initial threshold parameter. Optionally 
saturate sample magnitudes at t/ max and set x = 
sign (y). 

Step 1: Compute syndrome components: 

Si = || Xj . (1) 

ieN(i) 

for all i £ {1, 2,...., to}. If Si = +1 for all i, 
output x and stop. 

Step 2: Compute inversion functions: 

E k = XkVk + w k 5Z Si + qk ^ 

ieAt(fc) 

for k £ { 1 , 2 ,..., n}. where Wk is a syndrome 
weight parameter and qk is a Gaussian distributed 
random variable with zero mean and variance 
a 2 = q 2 No/2, where 0 < q < 1. All qk are 
independent and identically distributed. 

Step 3: Bit-flip operations: if Ek ( t ) < 6k ( t ) then 
Xk (t + 1) = —Xk (f), otherwise 6k (t + 1) = 
A 6k ( t ), where A is a global threshold adaptation 
parameter for which 0 < A < 1. 

Step 4: Repeat steps 1 to 3 until all Si = 1, for i = 
1, 2, ..., to, or maximum number of iterations 
T is reached. 

In this work we make use of the smoothing heuristic where an 
up/down counter is placed at the output of each symbol. Each 
counter is initialized at zero, and updated as follows: 

+ 1) = X'kit) + x k [t) (3) 

If all the parity checks are satisfied by Xk, then Xk is declared 
as the final result; otherwise the smoothed decision is used, 

x k = sign(A' fe ). 


III. Re-decoding and Trapping Sets 

Re-decoding from the same initial condition was consid¬ 
ered previously for stochastic decoders by Tehrani et al. 0, 
who proposed it as a method to evade trapping sets. In that 
paper, the authors demonstrated BER improvement due to re¬ 
decoding, but did not provide a detailed inspection of trapping 
set behavior. In this section, we present some experiments on 
the (8,8) absorbing set known to be dominant in the 802.3an 
10GBASE-T standard LDPC code under belief propagation 
m. The induced graph for this set is shown in Fig. Q] where 
the degree-one check nodes are indicated as ■, degree-two 
check nodes as □, and symbol nodes as •. While a full trapping 
set analysis has not yet been developed for NGDBF, in this 
section we verify that the (8,8) set acts as a trapping set for 
GDBF and NGDBF, and we inspect the dynamics that allow 
NGDBF to evade the trapping set during decoding. 

To investigate NGDBF dynamics on this absorbing set, a 
localized simulation was performed on the (8,8) subgraph. 
The correct state is assumed to be c = (+1 +1 • • • + 1). The 
GDBF and NGDBF algorithms were simulated with identical 
inputs y = c + z, where z is a vector of zero-mean Gaussian 
noise samples with a = 1. The simulations were performed 
with parameters A = 1 (i.e. without threshold adaptation), 
w = 1, T = 100 and for NGDBF q = 1. In these simulations, 
a frame was considered successful if the correct result, x = c, 
was obtained for at least one iteration. Failed frames were 
saved for detailed inspection. 

It was found that failed frames typically begin in a 
metastable initial condition, where one or two early flips 
determine the ultimate trajectory. Fig. [2] shows the trajectory of 
inversion functions for a case in which GDBF becomes trapped 
in an oscillating cycle, but NGDBF avoids the oscillation due 
to a fortuitous early flip. In the early iterations, some of the 
Ek are negative or weakly positive, so they are likely to be 
flipped. In later iterations, most of the NGDBF Ek values are 
strongly positive, so additional flips are unlikely in spite of the 
noise perturbations. 

Fig. [3] shows a case where NGDBF failed due to an errant 
early flip. In this case, NGDBF eventually converged on an all¬ 
error state with positive Ek . Because the Ek are positive, future 
flips are unlikely to occur and the error state is effectively 
stable. Fig.[4]shows a repeated simulation from the same initial 
condition. In this case, NGDBF made different flips in the first 
five iterations, and converged on the correct state. This example 
demonstrates advantages of re-decoding from the same initial 
state, which cannot be achieved by extending the simulation 
time. The benefits of re-decoding are more pronounced when 
threshold adaptation is used, since the evolving thresholds tend 
to harden the stability of the final state, thereby lowering the 
probability that NGDBF will escape to the correct state if given 
more iterations. 

These experiments were repeated for several values of a, 
and NGDBF was found to have a consistently lower rate 
of converging on an erroneous state compared to GDBF. 
The simulation method used here is illuminating about the 
dynamics and provides motivation for re-decoding, but it is 
not sufficient to quantify the frame error probability associated 
with this absorbing set. For cr < 0.7, we did not obtain 
any failed cases for NGDBF. In the sequel, we evaluate the 
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Fig. 1: The dominant ( 8 , 8 ) absorbing set in the 802.3an 
10GBASE-T LDPC code. 



Fig. 3: A case where NGDBF settles on an all-error pattern 
on the ( 8 , 8 ) absorbing set. Error propagation is triggered by 
a single errant bit-flip that occurs in the first five iterations. 



Fig. 2: A typical case where GDBF is trapped but NGDBF 
escapes due to random perturbations. 



Fig. 4: A re-decoded case with the same initial conditions 
as Fig. [3 This time NGDBF evades the erroneous state and 
corrects all errors. 


re-decoding method for two practical codes, and show that 
significant performance benefits are obtained. 

IV. Simulation results 

Simulations were performed using re-decoding with 
NGDBF for two codes: the rate 1/2 regular (3,6) LDPC code 
identified as PEGReg504xl008 in MacKay’s online encyclope¬ 
dia of sparse graph codes Ifl2l and the rate 0.8413 LDPC code 
defined in IEEE 802.3 standard. The smoothing and threshold 
adaptation heuristics are used for the PEGReg504xl008 code, 
and the algorithm name is indicated as SM-NGDBF in reported 
simulations. For the 802.3 code, these heuristics were not used. 
Each frame was allowed to be simulated up to a maximum 
number $ of re-decoding phases. At least 200 bit errors and 
at least 20 word errors were observed to obtain the BER 
measurements for each simulation. 

For the PEGReg504xl008 code, SM-NGDBF was simu¬ 
lated with parameters T = 300, w = 0.816, A = 0.98, initial 


threshold 9 = —0.6 and noise scale 77 = 0.75. Fig. [5] shows 
BER performance results for the PEGReg504xl008 code with 
<I> = 10. The re-decoding technique has significant gain when 
applied to the SM-NGDBF algorithm. At BER= 10 -6 , re¬ 
decoding provides a gain of about 0.5 dB. Output smoothing 
is only used for iterations exceeding (T — 64). Performance 
of NMS and BP algorithms are presented for comparison. 
The BER improves with higher values of <1>, but there is 
a diminishing benefit as $ is increased. As seen in Fig. [ 6 ] 
there is a rapid improvement in performance from = 1 
through $ = 5. As $ is increased further, the improvement 
in BER performance becomes less significant. There is slight 
improvement in <T> = 10 compared to the improvement in the 
earlier phases. 

The simulations show that re-decoding is necessary for 
a small fraction of frames. Fig. [7| shows the distribution of 
re-decoding phases to complete decoding. Most of the failed 






























frames that are not corrected in the first decoding phase are cor¬ 
rected by the second phase. The frames that are not corrected 
by the second phase are passed onto the third phase and so 
forth. Finally, at the last phase, the accumulated failed frames 
determine the word error rate (WER). This accumulation is 
evident in the last phase in Fig. [7] 

Fig. [8] shows the BER performance results for NGDBF 
on the IEEE 802.3 standard LDPC code. Smoothing is not 
used for simulations with this code, because no significant 
improvement was achieved using smoothing in this case. The 
simulation parameters are T = 1000, w = 0.20833, A = 1, 
9 = —0.525, and rj = 0.92. Since A = 1, threshold adaptation 
is not required for this code. To evaluate the performance we 
use a recently reported 802.3 Offset Min-Sum (OMS) decoder 
as a benchmark lfT3l . Re-decoding provides a gain of 0.25 dB 
for this code. The BER performance of NGDBF is very close 
to the benchmark OMS decoder. 

Since re-decoding incurs a substantial latency penalty, we 
examined the average latency on the 802.3 code. If buffering 
can be tolerated by the end application, then the average 
latency penalty is very small since nearly all frames are 
successfully decoded in the first phase. Fig. 0 shows the 
average latency in terms of clock cycles for all simulated cases, 
in comparison to the reported latency of the OMS decoder. The 
OMS decoder uses a semi parallel layered architecture which 
requires 12 clock cycles to complete an iteration. The reported 
average number of iterations is therefore scaled by 12 to obtain 
the average latency depicted in Fig. [7] The NGDBF decoder 
has a much lower complexity than the OMS algorithm, so 
layering is not required and we can expect every iteration to 
complete in a single clock cycle. The number of iterations is 
therefore equivalent to the latency in the NGDBF case. Fig. [9] 
shows that the average latency for NGDBF is quite large at low 
SNR values, but decreases at higher SNRs where it has a lower 
latency than OMS. When operating at higher SNR values, re¬ 
decoded NGDBF offers better performance and lower average 
latency than the benchmark design. To account for consecutive 
worst case frames, the NGDBF decoder would require a larger 
frame buffer compared to the OMS decoder. This is a potential 
drawback of re-decoding with the NGDBF algorithm. 

V. Conclusion 

The NGDBF algorithm has been modified by the appli¬ 
cation of the re-decoding method described in this paper. 
NGDBF is a low complexity algorithm compared to stochastic 
decoding, OMS, NMS or BP algorithms. Re-decoding takes 
advantage of the inherent random nature of NGDBF and 
further enhances its performance. Re-decoding provides a gain 
of up to 0.5 dB over the original NGDBF algorithm for the 
codes examined in this paper. For the IEEE 802.3 standard 
LDPC code, re-decoding yields performance very close to a 
benchmark OMS decoder design and requires less average 
latency. 

The potential drawbacks to re-decoding include high la¬ 
tency when operating at low SNRs. Frame buffering is also 
needed to accommodate the additional delay required for re¬ 
decoding unsuccessful frames. The applicability of re-decoding 
is application dependent, since it may not be possible to delay 
frame delivery. The buffering requirement is relaxed at higher 
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Fig. 5: BER results for the PEGReg504xl008 code. Results 
for belief propagation (BP) and normalized min-sum (NMS) 
with different iterations are provided for comparison. 



Fig. 6: BER for re-decoding with the SM-NGDBF on the 
PEGReg504xl008 code for different <l's. 


SNRs, as a smaller fraction of frames utilize the re-decoding 
phases. Future research can be done to optimize the buffer size 
and frame scheduling for consecutive worst case frames at low 
SNRs. It would also be interesting to quantify absorbing set 
contributions to the error rate in NGDBF, where re-decoding 
may offer benefits in the error-floor region. 
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Fig. 7: A histogram showing the fraction of frames com¬ 
pleted at each decoding phase for SM-NGDBF on the PE- 
GReg504xl008 code, <1» = 10. The increase of frames at the 
last phase arises due to the accumulation of failed frames. 



Fig. 8: BER for re-decoded NGDBF compared to a benchmark 
OMS decoder for the IEEE 802.3 standard LDPC code. 


Catalyst grant from Utah State University. We also thank Dr. 
Emmanuel Boutillon and lab-STICC at UBS (Universite de 
Bretagne Sud), Lorient, France, for their hospitality during the 
early phase of this work. 

References 

[1] Y. Kou, S. Lin, and M. Fossorier, “Low density parity check codes 
based on finite geometries: a rediscovery,” in Information Theory, 2000. 
Proceedings. IEEE International Symposium on, 2000, p. 200. 


[2] X. Wu, C. Zhao, and X. You, “Parallel weighted bit-flipping decoding,” 
Communications Letters, IEEE, vol. 11, no. 8, pp. 671-673, Aug. 2007. 



2 3 4 5 

SNR 


Fig. 9: Latency comparison between different algorithms and 
codes. The dashed lines indicate simulations on the PE- 
GReg504xl008 code, and solid lines indicate simulations on 
the IEEE 802.3 standard LDPC code. 


[3] J. Zhang and M. P. C. Fossorier, “A modified weighted bit-flipping 
decoding of low-density parity-check codes,” Communications Letters, 
IEEE, vol. 8, no. 3, pp. 165-167, Mar. 2004. 

[4] T. Wadayama, K. Nakamura, M. Yagita, Y. Funahashi, S. Usami, and 
I. Takumi, “Gradient descent bit flipping algorithms for decoding LDPC 
codes,” Communications, IEEE Transactions on, vol. 58, no. 6, pp. 
1610-1614, 2010. 

[5] T. Richardson, “Error-floors of LDPC codes,” in Proceedings of the 
41st Annual Conference on Communication, Control and Computing, 
2003, pp. 1426-1435. 

[6] G. Sundararajan, C. Winstead, and E. Boutillon, “Noisy gradient descent 
bit-flip decoding for decoding LDPC codes,” Communications, IEEE 
Transactions on, in press 2014b. 

[7] S. Sharifi Tehrani, A. Naderi, G.-A. Kamendje, S. Hemati, S. Mannor, 
and W. J. Gross, “Majority-based tracking forecast memories for 
stochastic LDPC decoding,” Signal Processing, IEEE Transactions on, 
vol. 58, no. 9, pp. 4883^1896, 2010. 

[8] F. Leduc-Primeau, S. Hemati, S. Mannor, and W. Gross, “Dithered 
belief propagation decoding,” Communications, IEEE Transactions on, 
vol. 60, no. 8, pp. 2042-2047, 2012. 

[9] K. Cushon et al., “High-throughput energy-efficient LDPC decoders 
using differential binary message passing,” Signal Processing, IEEE 
Transactions on, vol. 62, no. 3, pp. 619-631, Feb 2014. 

[10] M. Ismail, I. Ahmed, J. Coon, S. Armour, T. Kocak, and J. McGeehan, 
“Low latency low power bit flipping algorithms for LDPC decoding,” 
in Personal Indoor and Mobile Radio Communications (PIMRC), 2010 
IEEE 21st International Symposium on. Sept. 2010, pp. 278 -282. 

[11] S. Zhang and C. Schlegel, “Controlling the error floor in LDPC 
decoding,” Communications, IEEE Transactions on, vol. 61, no. 9, pp. 
3566-3575, September 2013. 

[12] D. J. C. MacKay. Encyclopedia of sparse 
graph codes. Accessed: 2014-06-20. [Online]. Available: 
http ://'www. inference. phy. cam. ac. uk/mackay/code s/data, html 

[13] Z. Zhang et al., “An efficient 10GBASE-T ethemet LDPC decoder 
design with low error floors,” IEEE J. Solid-State Circ., vol. 45, pp. 
843-855, Apr. 2010. 






















































































